The construction of amino acid substitution matrices for the comparison of proteins with non-standard compositions

نویسندگان

  • Yi-Kuo Yu
  • Stephen F. Altschul
چکیده

MOTIVATION Amino acid substitution matrices play a central role in protein alignment methods. Standard log-odds matrices, such as those of the PAM and BLOSUM series, are constructed from large sets of protein alignments having implicit background amino acid frequencies. However, these matrices frequently are used to compare proteins with markedly different amino acid compositions, such as transmembrane proteins or proteins from organisms with strongly biased nucleotide compositions. It has been argued elsewhere that standard matrices are not ideal for such comparisons and, furthermore, a rationale has been presented for transforming a standard matrix for use in a non-standard compositional context. RESULTS This paper presents the mathematical details underlying the compositional adjustment of amino acid or DNA substitution matrices.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The compositional adjustment of amino acid substitution matrices.

Amino acid substitution matrices are central to protein-comparison methods. In most commonly used matrices, the substitution scores take a log-odds form, involving the ratio of "target" to "background" frequencies derived from large, carefully curated sets of protein alignments. However, such matrices often are used to compare protein sequences with amino acid compositions that differ markedly ...

متن کامل

Genome bias influences amino acid choices: analysis of amino acid substitution and re-compilation of substitution matrices exclusive to an AT-biased genome

The genomic era has seen a remarkable increase in the number of genomes being sequenced and annotated. Nonetheless, annotation remains a serious challenge for compositionally biased genomes. For the preliminary annotation, popular nucleotide and protein comparison methods such as BLAST are widely employed. These methods make use of matrices to score alignments such as the amino acid substitutio...

متن کامل

Inconsistent Distances in Substitution Matrices can be Avoided by Properly Handling Hydrophobic Residues

The adequacy of substitution matrices to model evolutionary relationships between amino acid sequences can be numerically evaluated by checking the mathematical property of triangle inequality for all triplets of residues. By converting substitution scores into distances, one can verify that a direct path between two amino acids is shorter than a path passing through a third amino acid in the a...

متن کامل

In situ ion substitution of sodium gluconate: Comparison of bipolar membrane electrodialysis and electro-membrane reactor for producing gluconic acid

Based on the home-made cation-exchange membrane (CEM) and bipolar membrane (BPM), electrodialysis with bipolar membrane (EDBPM) and electro-membrane reactor with three compartments (EMR-3) were developed to achieve in situ ion substitution and recovery of gluconic acid (GLH) from its sodium salt. Physicochemical and electrochemical properties of CEM and BPM were studied to assess their...

متن کامل

Statistical evaluation of pairwise protein sequence comparison with the Bayesian bootstrap

MOTIVATION Protein sequence comparison methods are routinely used to infer the intricate network of evolutionary relationships found within the rapidly growing library of protein sequences, and thereby to predict the structure and function of uncharacterized proteins. In the present study, we detail an improved statistical benchmark of pairwise protein sequence comparison algorithms. We use boo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 21 7  شماره 

صفحات  -

تاریخ انتشار 2005